AITopics | local linearization

TD converges at a sublinear rate to the global optimum of the mean-squared projected Bellman error for policy evaluation.

artificial intelligence, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

e3bc4e7f243ebc05d66a0568a3331966-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-10-2026, 20:46:07 GMT

feature representation, neural network, revision, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.83)

Add feedback

ProvablyEfficientNeuralEstimationofStructural EquationModel: AnAdversarialApproach

Neural Information Processing SystemsFeb-8-2026, 16:56:21 GMT

Structural equation models (SEMs) are widely used in sciences, ranging from economics topsychology,touncovercausal relationships underlying acomplex system under consideration and estimate structural parameters of interest. We study estimation in a class of generalized SEMs where the object of interest is defined as the solution to a linear operator equation.

artificial intelligence, arxivpreprintarxiv, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Adversarial Robustness through Local Linearization

Neural Information Processing SystemsDec-25-2025, 00:36:35 GMT

Adversarial training is an effective methodology for training deep neural networks that are robust against adversarial, norm-bounded perturbations. However, the computational cost of adversarial training grows prohibitively as the size of the model and number of input dimensions increase. Further, training against less expensive and therefore weaker adversaries produces models that are robust against weak attacks but break down under attacks that are stronger. This is often attributed to the phenomenon of gradient obfuscation; such models have a highly non-linear loss surface in the vicinity of training examples, making it hard for gradient-based attacks to succeed even though adversarial examples still exist. In this work, we introduce a novel regularizer that encourages the loss to behave linearly in the vicinity of the training data, thereby penalizing gradient obfuscation while encouraging robustness. We show via extensive experiments on CIFAR-10 and ImageNet, that models trained with our regularizer avoid gradient obfuscation and can be trained significantly faster than adversarial training. Using this regularizer, we exceed current state of the art and achieve 47% adversarial accuracy for ImageNet with L-infinity norm adversarial perturbations of radius 4/255 under an untargeted, strong, white-box attack. Additionally, we match state of the art results for CIFAR-10 at 8/255.

adversarial robustness, gradient obfuscation, name change, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.59)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.59)

Add feedback

Neural Temporal-Difference Learning Converges to Global Optima

Qi Cai, Zhuoran Yang, Jason D. Lee, Zhaoran Wang

Neural Information Processing SystemsOct-3-2025, 06:53:11 GMT

TD converges at a sublinear rate to the global optimum of the mean-squared projected Bellman error for policy evaluation.

arxiv preprint arxiv, function approximation, neural network, (12 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

e3bc4e7f243ebc05d66a0568a3331966-AuthorFeedback.pdf

Neural Information Processing SystemsJun-1-2025, 08:15:26 GMT

artificial intelligence, machine learning, neural network, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.43)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.30)

Add feedback

Reviews: Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy

Neural Information Processing SystemsJan-22-2025, 07:33:15 GMT

Originality: The authors apply the idea that overparametrization induces local linearization, which has been documented for supervised learning, and in another submission for TD learning. In particular, they decompose the error into two terms, one due to TD, and the other due to SGD, and incorporate them in the analysis of infinite-dimensional mirror descent. The insight that the previous previous analysis for TD could be generalised to a meta algorithm that includes both TD and SGD as particular cases is key. Related work is adequately cited, and differences with previous works are clearly stated, including differences with the sister submission [5]. Quality: The submission seems technically sound, and includes detailed proofs (I just skimmed through them). This is a complete piece of work.

architecture, optimization attain globally optimal policy, submission, (10 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.77)

Add feedback

Reviews: Adversarial Robustness through Local Linearization

Neural Information Processing SystemsJan-21-2025, 12:49:31 GMT

This paper suggests and experimentally validates a novel regularization method to enhaned adversarial robustness of a neural network image classifier. The proposed method is carefully motivated and introduced and extensively validated. The authors claim improved computational efficiency while (mostly) achieving state of the art performance in terms of adversarial robustness. No theoretical analysis is provided. The reviewers appreciated the work.

adversarial robustness, final version, local linearization, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

Adversarial Robustness through Local Linearization

Neural Information Processing SystemsOct-9-2024, 12:46:42 GMT

Adversarial training is an effective methodology for training deep neural networks that are robust against adversarial, norm-bounded perturbations. However, the computational cost of adversarial training grows prohibitively as the size of the model and number of input dimensions increase. Further, training against less expensive and therefore weaker adversaries produces models that are robust against weak attacks but break down under attacks that are stronger. This is often attributed to the phenomenon of gradient obfuscation; such models have a highly non-linear loss surface in the vicinity of training examples, making it hard for gradient-based attacks to succeed even though adversarial examples still exist. In this work, we introduce a novel regularizer that encourages the loss to behave linearly in the vicinity of the training data, thereby penalizing gradient obfuscation while encouraging robustness.

adversarial robustness, gradient obfuscation, local linearization, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.62)

Add feedback

Adversarial Robustness through Local Linearization

Qin, Chongli, Martens, James, Gowal, Sven, Krishnan, Dilip, Dvijotham, Krishnamurthy, Fawzi, Alhussein, De, Soham, Stanforth, Robert, Kohli, Pushmeet

Neural Information Processing SystemsMar-19-2020, 02:17:23 GMT

Adversarial training is an effective methodology for training deep neural networks that are robust against adversarial, norm-bounded perturbations. However, the computational cost of adversarial training grows prohibitively as the size of the model and number of input dimensions increase. Further, training against less expensive and therefore weaker adversaries produces models that are robust against weak attacks but break down under attacks that are stronger. This is often attributed to the phenomenon of gradient obfuscation; such models have a highly non-linear loss surface in the vicinity of training examples, making it hard for gradient-based attacks to succeed even though adversarial examples still exist. In this work, we introduce a novel regularizer that encourages the loss to behave linearly in the vicinity of the training data, thereby penalizing gradient obfuscation while encouraging robustness.

adversarial robustness, gradient obfuscation, local linearization, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.62)

Add feedback

Filters

Collaborating Authors

local linearization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Neural Temporal-Difference Learning Converges to Global Optima

e3bc4e7f243ebc05d66a0568a3331966-AuthorFeedback.pdf

ProvablyEfficientNeuralEstimationofStructural EquationModel: AnAdversarialApproach

Adversarial Robustness through Local Linearization

Neural Temporal-Difference Learning Converges to Global Optima

e3bc4e7f243ebc05d66a0568a3331966-AuthorFeedback.pdf

Reviews: Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy

Reviews: Adversarial Robustness through Local Linearization

Adversarial Robustness through Local Linearization

Adversarial Robustness through Local Linearization